Picture for Chen Li

Chen Li

Beihang University

MBench: A Comprehensive Benchmark on Memory Capability for Video World Models

Add code
May 30, 2026
Viaarxiv icon

DRM: Diffusion-based Reward Model With Step-wise Guidance

Add code
May 25, 2026
Viaarxiv icon

ClueAegis: Heuristic-to-Reasoning Cognitive-skill Learning for Unified Evidence-based Synthetic Image Detection

Add code
May 24, 2026
Viaarxiv icon

PathNavigate: A Training-Free Pathology Agent with Surprise-Guided Scan and Shared Slide Memory for Whole-Slide Image VQA

Add code
May 22, 2026
Viaarxiv icon

Thinking in Scales: Accelerating Gigapixel Pathology Image Analysis via Adaptive Continuous Reasoning

Add code
May 19, 2026
Viaarxiv icon

Breaking Dual Bottlenecks: Evolving Unified Multimodal Models into Self-Adaptive Interleaved Visual Reasoners

Add code
May 14, 2026
Viaarxiv icon

ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models

Add code
May 12, 2026
Viaarxiv icon

Improving Temporal Action Segmentation via Constraint-Aware Decoding

Add code
May 11, 2026
Viaarxiv icon

When to Re-Commit: Temporal Abstraction Discovery for Long-Horizon Vision-Language Reasoning

Add code
May 11, 2026
Viaarxiv icon

OmniHuman: A Large-scale Dataset and Benchmark for Human-Centric Video Generation

Add code
Apr 20, 2026
Viaarxiv icon